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PAGE: 1 RAW SEQUENCE LISTING DATE: 06/23/98 

PATENT APPLICATION US/09/096,648 TIME: 07:42:03 

INPUT SET: S26847.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



1 SEQUENCE LISTING r*m « „ 

3 (1) General Information: ^ ' 8 C 

4 

5 (i) APPLICANT: Hadlaczky, Gyula 

6 Szalay, Aladar 
7 

8 (ii) TITLE OF INVENTION: ARTIFICIAL CHROMOSOMES, USES THEREOF AND 

9 METHODS PREPARING ARTIFICIAL CHROMOSOMES 
10 

11 (iii) NUMBER OF SEQUENCES: 12 
12 

13 (iv) CORRESPONDENCE ADDRESS: 

14 (A) ADDRESSEE: Brown, Martin, Haller & McClain 

15 (B) STREET: 1660 Union Street 

16 (C) CITY: San Diego 

17 (D) STATE: CA 

18 (E) COUNTRY: USA 

19 (F) ZIP: 92101-2926 
20 

21 (v) COMPUTER READABLE FORM: 

22 (A) MEDIUM TYPE: Diskette 

2 3 (B) COMPUTER: IBM Compatible 

24 (C) OPERATING SYSTEM: DOS 

25 (D) SOFTWARE: FastSEQ Version 1.5 
26 

27 (vi) CURRENT APPLICATION DATA: 

28 (A) APPLICATION NUMBER: US/09/096 , 648 

29 (B) FILING DATE: 

30 (C) CLASSIFICATION: 
31 

32 (vii) PRIOR APPLICATION DATA: 

33 (A) APPLICATION NUMBER: 08/629,822 

34 (B) FILING DATE: 10-APR-1996 
35 

36 (viii) ATTORNEY/AGENT INFORMATION: 

37 (A) NAME: Seidman, Stephanie L 

38 (B) REGISTRATION NUMBER: 33,779 

39 (C) REFERENCE/DOCKET NUMBER: 6869-402A 
40 

41 (ix) TELECOMMUNICATION INFORMATION: 

42 (A) TELEPHONE: 619-238-0999 

43 (B) TELEFAX: 619-238-0062 

44 (C) TELEX: 
45 

46 




PAGE: 2 RAW SEQUENCE LISTING DATE: 06/23/98 

PATENT APPLICATION US/09/096,648 TIME: 07:42:05 

INPUT SET: S26847.raw 

47 (2) INFORMATION FOR SEQ ID NO:l: 

48 

49 (i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 1293 base pairs 

51 (B) TYPE: nucleic acid 

52 (C) STRANDEDNESS : single 
5 3 ( D) TOPOLOGY: linear 

54 

55 (ii) MOLECULE TYPE: Genomic DNA 

56 (iii) HYPOTHETICAL: NO 

57 (iv) ANTI- SENSE: NO 

58 (V) FRAGMENT TYPE: 

5 9 (vi) ORIGINAL SOURCE: 
60 (ix) FEATURE: 

61 

62 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

63 

64 GAATTCATCA TTTTTCANGT CCTCAAGTGG ATGTTTCTCA TTTNCCATGA TTTTAAGTTT 60 

65 TCTCGCCATA TTCCTGGTCC TACAGTGTGC ATTTCTCCAT TTTNCACGTT TTNCAGTGAT 120 

66 TTCGTCATTT TCAAGTCCTC AAGTGGATGT TTCTCATTTN CCATGAATTT CAGTTTTCTN 180 

67 GCCATATTCC ACGTCCTACA GNGGACATTT CTAAATTTNC CACCTTTTTC AGTTTTCCTC 240 

68 GCCATATTTC ACGTCCTAAA ATGTGTATTT CTCGTTTNCC GTGATTTTCA GTTTTCTCGC 300 

6 9 CAGATTCCAG GTCCTATAAT GTGCATTTCT CATTTNNCAC GTTTTTCAGT GATTTCGTCA 360 

70 TTTTTTCAAG TCGGCAAGTG GATGTTTCTC ATTTNCCATG ATTTNCAGTT TTCTTGNAAT 420 

71 ATTCCATGTC CTACAATGAT CATTTTTAAT TTTCCACCTT TTCATTTTTC CACGCCATAT 480 

72 TTCATGTCCT AAAGTGTATA TTTCTCCTTT TCCGCGATTT TCAGTTTTCT CGCCATATTC 540 

73 CAGGTCCTAC AGTGTGCATT CCTCATTTTT CACCTTTTTC ACTGATTTCG TCATTTTTCA 600 

74 AGTCGTCAAC TGGATCTTTC TAATTTTCCA TGATTTTCAG TTATCTTGTC ATATTCCATG 660 

75 TCCTACAGTG GACATTTCTA AATTTTCCAA CTTTTTCAAT TTTTCTCGAC ATATTTGACG 720 

76 TGCTAAAGTG TGTATTTCTT ATTTTCCGTG ATTTTCAGTT TTCTCGCCAT ATTCCAGGTC 780 

77 CTAATAGTGT GCATTTCTCA TTTTTCACGT TTTTCAGTGA TTTCGTCATT TTTTCCAGTT 840 

78 GTCAAGGGGA TGTTTCTCAT TTTCCATGAG TGTCAGTTTT CTTGCTATAT TCCATGTCCT 900 

7 9 ACAGTGACAT TTCTAAATAT TATACCTTTT TCAGTTTTTC TCACCATATT TCACGTCCTA 960 

80 AAGTATATAT TTCTCATTTT CCCTGATTTT CAGTTTCCTT GCCATATTCC AGGTCCTACA 1020 

81 GTGTGCATTT CTCATTTTTC ACGTTTTTCA GTAATTTCTT CATTTTTTAA GCCCTCAAAT 1080 

82 GGATGTTTCT CATTTTCCAT GATTTTCAGT TTTCTTGCCA TATACCATGT CCTACAGTGG 1140 

83 ACATTTCTAA ATTATCCACC TTTTTCAGTT TTTCATCGGC ACATTTCACG TCCTAAAGTG 1200 

84 TGTATTTCTA ATTTTCAGTG ATTTTCAGTT TTCTCGCCAT ATTCCAGGAC CTACAGTGTG 1260 

85 CATTTCTCAT TTTTCACGTT TTTCAGTGAA TTC 1293 
86 

87 (2) INFORMATION FOR SEQ ID NO: 2: 

88 

89 (i) SEQUENCE CHARACTERISTICS: 

90 (A) LENGTH: 1044 base pairs 

91 (B) TYPE: nucleic acid 

92 (C) STRANDEDNESS: single 

93 (D) TOPOLOGY: linear 
94 

95 (ii) MOLECULE TYPE: Genomic DNA 

96 (iii) HYPOTHETICAL: NO 

97 (iv) ANTI-SENSE: NO 

98 (V) FRAGMENT TYPE: 

99 (vi) ORIGINAL SOURCE: 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/096,648 



(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



DATE: 06/23/98 
TIME: 07:42:06 



INPUT SET: S26847.raw 



AGGCCTATGG 
TCTTATTTGT 
TTTTGAAACA 
CGTTGGAAAC 
TGGGATGTTT 
ACACTCTTTT 
AAAGGAAATA 
GTATCTACTC 
TTTGTGGAAT 
TACATATAAA 
TCACAGAGTT 
GGATGTGGAC 
AACTAGACAG 
GAAGCTTTCT 
ATTTGTCTAG 
CATTCCCAGA 
CAGAGAGCAG 
CAGGGGGGAT 



TGAAAAAGGA 
GATGTGCGCC 
CTCTTTTTGT 
GGGATTGTCT 
CAGTTGAAGT 
TTGTAGTATC 
TCTTCCAATA 
AGCTAACAGA 
CTGCAAGTGG 
AAGCAGACAG 
GAACATTCCC 
ATTTGCAGCG 
AAGCATTCTC 
TTTGATAGAG 
CTTTGAGGAT 
ATCTTGTTTG 
GTTTGAACAC 
CCTCTAGAAT 



AATATCTTCC 
CCTCAACTAA 
AAAATCTGCA 
TCATATAAAC 
CACAGTGTTG 
TGGAAGTGGA 
AAAGCTAGAT 
GTTGAACCTT 
ATATTTGTCT 
CAGCATTCCC 
TTTCATAGAG 
CTTTCAGGCC 
AGAAACTTAT 
GCAGTTTTGA 
TTCTTTGGAA 
TGATGTTTGC 
TCTTTTTATA 
TCCT 



CCTGAAAACT 
CAGTGTTGAA 
AGAGGATATT 
CCTAGACAGA 
AACAGTCCCC 
CATTTGGAGC 
AGAGGCAATG 
CCTTTGAGAG 
AGCTTTGAGG 
AGAAACTTCT 
CAGGTTTGAA 
TAAGGTGAAA 
TTGTGATGTG 
AACACTCTTT 
ACGGGATTAC 
ATTCAAGTCA 
GTATCTGGAT 



AGACAGAAGG 
GCTTTCTTTT 
TGGATAGCTT 
AGCATTCTCA 
TTTCATAGAG 
GATCTCAGGA 
TCAGAAACCT 
AGCAGTTTTG 
ATTTCGTTGG 
TTGTGATGTT 
ACACACTTTT 
AGGAAATATC 
CGCCCTCAAC 
TGTGGAATCT 
ATATAAAAAG 
CAGAGTTGAA 
GTGGACATTT 



ATTCTCAGAA 
GATAGAGCAG 
TGAGGATTTC 
GAAGCTTCAT 
CAGGTTTGAA 
CTGCGGTGAA 
TTTTCATGAT 
AAACACTCTT 
GAAACGGGAT 
TGCATTCAAG 
TGATGTATCT 
TTCCCCTGAA 
TAACAGTGTT 
GCAAGTGGAT 
CAGACAGCAG 
CATTCCCTTT 
GGAGCGCTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1044 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2492 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTGCAGCTGG GGGTCTCCAA TCAGGCAGGG GCCCCTTACT ACTCAGATGG GGTGGCCGAG 60 

TAGGGGAAGG GGGTGCAGGC TGCATGAGTG GACACAGCTG TAGGACTACC TGGGGGCTGT 120 

GGATCTATGG GGGTGGGGAG AAGCCCAGTG ACAGTGCCTA GAAGAGACAA GGTGGCCTGA 180 

GAGGGTCTGA GGAACATAGA GCTGGCCATG TTGGGGCCAG GTCTCAAGCA GGAAGTGAGG 240 

AATGGGACAG GCTTGAGGAT ACTCTACTCA GTAGCCAGGA TAGCAAGGAG GGCTTGGGGT 300 

TGCTATCCTG GGGTTCAACC CCCCAGGTTG AAGGCCCTGG GGGAGATGGT CCCAGGACAT 360 

ATTACAATGG ACACAGGAGG TTGGGACACC TGGAGTCACC AAACAAAACC ATGCCAAGAG 420 

AGACCATGAG TAGGGGTGTC CAGTCCAGCC CTCTGACTGA GCTGCATTGT TCAAATCCAA 480 

AGGGCCCCTG CTGCCACCTA GTGGCTGATG GCATCCACAT GACCCTGGGC CACACGCGTT 540 

TAGGGTCTCT GTGAAGACCA AGATCCTTGT TACATTGAAC GACTCCTAAA TGAGCAGAGA 600 

TTTCCACCTA TTCGAAACAA TCACATAAAA TCCATCCTGG AAAAAGCCTG GGGGATGGCA 66 0 




PAGE: 4 RAW SEQUENCE LISTING DATE: 06/23/98 

PATENT APPLICATION US/09/096,648 TIME: 07:42:08 

INPUT SET: S26847.raw 

15 3 CTAAGGCTAG GGATAGGGTG GGATGAAGAT TATAGTTACA GTAAGGGGTT TAGGGTTAGG 720 

154 GATCAACGTT GGTTAGGAGT TAGGGATACA GTAGGGTACC GGTAGGGTTA GGGGTTAGGG 780 

155 TTAGGGGTTA GGGTTAGGGT TAGGGTTAGG GTTAGGGTTA GGGGTTAGGG GTTAGGGTTA 84 0 

156 GGGTTAGGTT TTGGGGTGGC GTATTTTGGT CTTATACGCT GTGTTCCACT GGCAATGAAA 900 

157 AGAGTTCTTG TTTTTCCTTC AGCAATTTGT CATTTTTAAA AGAGTTTAGC AATTCTAACA 96 0 

158 GATATAGACC AGCTGTGCTA TCTCATTGTG GTTTTCAATT GTAACCACAT TGTGGTTTCA 1020 
15 9 ATGTGTTTAC TTGCCATCTG TAGATCTTCT TTGCGTGAGG TGTCTGTTCA GATGTGTGTG 1080 

160 CATTTCTTGN NTTTNGGCTG TTTAACTTAT TGTTTAGTTT TAATAATTTT TTATATATTT 1140 

161 GAAGACAAAT CTTTCTCAGA TGTGTATTTG CAAATATTTC TTCAATATGA GGCTTGCTTT 1200 

162 TGTCTCTAAC AAGGTCTCTT CAGAGATAAC TTAAATATAA GAAATCCACA CTGTCACTTC 126 0 

163 TTTTGTGTAT ATCTACCTTT TGTGTCATTT GTTAAAATTC ATTACCAAAC CCAAAGGCAG 1320 

164 ATAGCTTTTC TTCTATTGTT TCTTCTAGAA ATTTGTATAG TTTTGCATTT TTAGTGTAAG 1380 

165 GATGATTTTG AGTGATTATT TGTGTAAGTT GTAAAGTTTT CGTCTATATC CATATCATTT 1440 

166 CTTATGGTTT CCAATTAATC GTTCCCTCAC TATTTTTGGG AAAGACACAG GATAGTGGGC 1500 

167 TTTGTTAGAG TAGATAGGTA GCTAGACATG AACAGGAGGG GGCCTCCTGG AAAAGGGAAA 1560 

168 GTCTGGGAAG GCTCACCTGG AGGACCACCA AAAATTCACA TATTAGTAGC ATCTCTAGTG 162 0 

169 CTGGAGTGGA TGGGCACTTG TCAATTGTGG GTAGGAGGGA AAAGAGGTCC TATGCAGAAA 1680 

170 GAAACTCCCT AGAACTCCTC TGAAGATGCC CCAATCATTC ACTCTGCAAT AAAAATGTCA 1740 

171 GAATATTGCT AGCTACATGC TGATAAGGNN AAAGGGGACA TTCTTAAGTG AAACCTGGCA 1800 

172 CCATAAGTAC. AGATTAGGGC AGAGAAGGAC ATTCAAAAGA GGCAGGCGCA GTAGGTACAA 1860 

173 ACGTGATCGC TGTCAGTGTG CCTGGGATGG CGGGAAGGAG GCTGGTGCCA GAGTGGATTC 1920 

174 GTATTGATCA CCACACATAT ACCTCAACCA ACAGTGAGGA GGTCCCACAA GCCTAAGTGG 1980 

175 GGCAAGTTGG GGAGCTAAGG CAGTAGCAGG AAAACCAGAC AAAGAAAACA GGTGGAGACT 2040 

176 TGAGACAGAG GCAGGAATGT GAAGAAATCC AAAATAAAAT TCCCTGCACA GGACTCTTAG 2100 

177 GCTGTTTAAT GCATCGCTCA GTCCCACTCC TCCCTATTTT TCTACAATAA ACTCTTTACA 2160 

178 CTGTGTTTCT TTTCAATGAA GTTATCTGCC ATCTTTGTAT TGCCTCTTGG TGAAAATGTT 2220 
17 9 TCTTCCAAGT TAAACAAGAA CTGGGACATC AGCTCTCCCC AGTAATAGCT CCGTTTCAGT 2280 

180 TTGAATTTAC AGAACTGATG GGCTTAATAA CTGGCGCTCT GACTTTAGTG GTGCAGGAGG 2 34 0 

181 CCGTCACACC GGGACCAAGA GTGCCCTGCC TAGTCCCCAT CTGCCCGCAG GTGGCGGCTG 2400 

182 CCTCGACACT GACAGCAATA GGGTCCGGCA GTGTCCCCAG CTGCCAGCAG GGGGCGTACG 2460 

183 ACGACTACAC TGTGAGCAAG AGGGCCCTGC AG 24 92 
184 

185 (2) INFORMATION FOR SEQ ID NO: 4: 

186 

187 (i) SEQUENCE CHARACTERISTICS: 

188 (A) LENGTH: 28 base pairs 

189 (B) TYPE: nucleic acid 

190 (C) STRANDEDNESS : single 

191 (D) TOPOLOGY: linear 
192 

193 (ii) MOLECULE TYPE: Genomic DNA 

194 (iii) HYPOTHETICAL: NO 

195 (iv) ANTI-SENSE: NO 

196 (V) FRAGMENT TYPE: 

197 (vi) ORIGINAL SOURCE: 

198 (ix) FEATURE: 
199 

200 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

201 

202 GGGGAATTCA TTGGGATGTT TCAGTTGA 28 
203 

204 (2) INFORMATION FOR SEQ ID NO: 5: 

205 
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RAW SEQUENCE LISTING DATE: 06/23/98 

PATENT APPLICATION US/09/096, 648 TIME: 07:42: 1 0 

INPUT SET: S26847.raw 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGAAAGTCCC CCCTAGGAGA TCTTAAGGA 29 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCGCTTAATA CTCTGATGAG TCCGTGAGGA CGAAACGCTC TCGCACC 47 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 



PAGE: 1 SEQUENCE VERIFICATION REPORT DATE: 06/23/98 

PATENT APPLICATION US/09/096, 648 TIME: 07:42: 1 1 

INPUT SET: S26847.mw 



Line Error Original Text 



